687 research outputs found

    The role of parametric linkage methods in complex trait analyses using microsatellites

    Get PDF
    Many investigators of complexly inherited familial traits bypass classical segregation analysis to perform model-free genome-wide linkage scans. Because model-based or parametric linkage analysis may be the most powerful means to localize genes when a model can be approximated, model-free statistics may result in a loss of power to detect linkage. We performed limited segregation analyses on the electrophysiological measurements that have been collected for the Collaborative Study on the Genetics of Alcoholism. The resulting models are used in whole-genome scans. Four genomic regions provided a model-based LOD > 2 and only 3 of these were detected (p < 0.05) by a model-free approach. We conclude that parametric methods, using even over-simplified models of complex phenotypes, may complement nonparametric methods and decrease false positives

    Multiple genome-wide analyses of smoking behavior in the Framingham Heart Study

    Get PDF
    BACKGROUND: Cigarette smoking behavior may have a genetic basis. We assessed evidence for quantitative trait loci (QTLs) affecting the maximum number of cigarettes smoked per day, a trait meant to quantify this behavior, using data collected over 40 years as part of the Framingham Heart Study's original and offspring cohorts. RESULTS: Heritability was estimated to be approximately 21% using variance components (VC) methods (SOLAR), while oligogenic linkage and segregation analysis based on Bayesian Markov chain Monte Carlo (MCMC) methods (LOKI) estimated a mean of two large QTLs contributing approximately 28% and 20%, respectively, to the trait's variance. Genome-wide parametric (FASTLINK) and VC linkage analyses (SOLAR) revealed several LOD scores greater than 1.0, with peak LOD scores using both methods on chromosomes 2, 17, and 20; multi-point MCMC methods followed up on these chromosomes. The most robust linkage results were for a QTL between 65 and 84 cM on chromosome 20 with signals from multiple sex- and age-adjusted analyses including two-point LOD scores of 1.30 (parametric) and 1.07 (heritability = 0.17, VC) at 70.51 cM, a multi-point LOD score of 1.50 (heritability = 0.20, VC) at 84 cM, and an intensity ratio of 12.0 (MCMC) at 65 cM. CONCLUSION: Familial aggregation of the maximum number of cigarettes smoked per day was consistent with a genetic component to this behavior, and oligogenic segregation analyses using MCMC suggested two important QTLs. Linkage signals on chromosome 20 between 65 and 84 cM were seen using multiple analytical methods. No linkage result, however, met genome-wide statistical significance criteria, and the true relationship between these regions and smoking behavior remains unclear

    A Latent Model for Prioritization of SNPs for Functional Studies

    Get PDF
    One difficult question facing researchers is how to prioritize SNPs detected from genetic association studies for functional studies. Often a list of the top M SNPs is determined based on solely the p-value from an association analysis, where M is determined by financial/time constraints. For many studies of complex diseases, multiple analyses have been completed and integrating these multiple sets of results may be difficult. One may also wish to incorporate biological knowledge, such as whether the SNP is in the exon of a gene or a regulatory region, into the selection of markers to follow-up. In this manuscript, we propose a Bayesian latent variable model (BLVM) for incorporating “features” about a SNP to estimate a latent “quality score”, with SNPs prioritized based on the posterior probability distribution of the rankings of these quality scores. We illustrate the method using data from an ovarian cancer genome-wide association study (GWAS). In addition to the application of the BLVM to the ovarian GWAS, we applied the BLVM to simulated data which mimics the setting involving the prioritization of markers across multiple GWAS for related diseases/traits. The top ranked SNP by BLVM for the ovarian GWAS, ranked 2nd and 7th based on p-values from analyses of all invasive and invasive serous cases. The top SNP based on serous case analysis p-value (which ranked 197th for invasive case analysis), was ranked 8th based on the posterior probability of being in the top 5 markers (0.13). In summary, the application of the BLVM allows for the systematic integration of multiple SNP “features” for the prioritization of loci for fine-mapping or functional studies, taking into account the uncertainty in ranking

    Leveraging Global Gene Expression Patterns to Predict Expression of Unmeasured Genes

    Get PDF
    BackgroundLarge collections of paraffin-embedded tissue represent a rich resource to test hypotheses based on gene expression patterns; however, measurement of genome-wide expression is cost-prohibitive on a large scale. Using the known expression correlation structure within a given disease type (in this case, high grade serous ovarian cancer; HGSC), we sought to identify reduced sets of directly measured (DM) genes which could accurately predict the expression of a maximized number of unmeasured genes

    Comparison of tagging single-nucleotide polymorphism methods in association analyses

    Get PDF
    Several methods to identify tagging single-nucleotide polymorphisms (SNPs) are in common use for genetic epidemiologic studies; however, there may be loss of information when using only a subset of SNPs. We sought to compare the ability of commonly used pairwise, multimarker, and haplotype-based tagging SNP selection methods to detect known associations with quantitative expression phenotypes. Using data from HapMap release 21 on unrelated Utah residents with ancestors from northern and western Europe (CEPH-Utah, CEU), we selected tagging SNPs in five chromosomal regions using ldSelect, Tagger, and TagSNPs. We found that SNP subsets did not substantially overlap, and that the use of trio data did not greatly impact SNP selection. We then tested associations between HapMap genotypes and expression phenotypes on 28 CEU individuals as part of Genetic Analysis Workshop 15. Relative to the use of all SNPs (n = 210 SNPs across all regions), most subset methods were able to detect single-SNP and haplotype associations. Generally, pairwise selection approaches worked extremely well, relative to use of all SNPs, with marked reductions in the number of SNPs required. Haplotype-based approaches, which had identified smaller SNP subsets, missed associations in some regions. We conclude that the optimal tagging SNP method depends on the true model of the genetic association (i.e., whether a SNP or haplotype is responsible); unfortunately, this is often unknown at the time of SNP selection. Additional evaluations using empirical and simulated data are needed

    A Bayesian approach for applying Haseman-Elston methods

    Get PDF
    The main goal of this paper is to couple the Haseman-Elston method with a simple yet effective Bayesian factor-screening approach. This approach selects markers by considering a set of multigenic models that include epistasis effects. The markers are ranked based on their marginal posterior probability. A significant improvement over our previously proposed Bayesian variable selection methodology is a simple Metropolis-Hasting algorithm that requires minimum tuning on the prior settings. The algorithm, however, is also flexible enough for us to easily incorporate our hypotheses and avoid computational pitfalls. We apply our approach to the microsatellite data of Collaborative Studies on Genetics of Alcoholism using the coded values for the ALDX1 variable as our response

    Assessment of genotype imputation methods

    Get PDF
    Several methods have been proposed to impute genotypes at untyped markers using observed genotypes and genetic data from a reference panel. We used the Genetic Analysis Workshop 16 rheumatoid arthritis case-control dataset to compare the performance of four of these imputation methods: IMPUTE, MACH, PLINK, and fastPHASE. We compared the methods' imputation error rates and performance of association tests using the imputed data, in the context of imputing completely untyped markers as well as imputing missing genotypes to combine two datasets genotyped at different sets of markers. As expected, all methods performed better for single-nucleotide polymorphisms (SNPs) in high linkage disequilibrium with genotyped SNPs. However, MACH and IMPUTE generated lower imputation error rates than fastPHASE and PLINK. Association tests based on allele "dosage" from MACH and tests based on the posterior probabilities from IMPUTE provided results closest to those based on complete data. However, in both situations, none of the imputation-based tests provide the same level of evidence of association as the complete data at SNPs strongly associated with disease

    Changes in Physical Activity after Sacrocolpopexy for Advanced Pelvic Organ Prolapse

    Get PDF
    To describe changes in physical activity one year after sacrocolpopexy for pelvic organ prolapse
    • …
    corecore